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SYSTEM AND METHOD FOR AUTOMATIC AND ADAPTIVE USE OF 
ACTIVE NETWORK PERFORMANCE MEASUREMENT TECHNIQUES 
/ TO FIND THE FASTEST SOURCE 

5 Technical Field of the Invention 

The present invention relates to optimizing download performance from 
either a client-server based or peer-to-peer type network. 

Background of the Invention 

10 Search engines, such as Google, and Peer-to-Peer (P2P) services, such as 

Napster, gnutella, and Freenet, have made locating files and information on the 
Internet very easy. However, with tens of millions of computers coupled to the 
Internet, often there are several sources from which to download any particular file. 
However, the user has no automatic or adaptive means through which to select the 

1 5 source that will download the entire file in the least amount of time. 

In some conventional P2P apphcations, each file that is available to be 
downloaded is associated with an indicated download throughput speed of the 
communication link of the source computer on which the file is stored. However, 
the associated throughput speed is not reUable information because the throughput 

20 speed is manually specified by a user operating the computer on which the file is 
stored from the user's knowledge of the optimal throughput speed. In addition, the 
associated throughput speed is not reliable information because even when the user 
correctly specifies or indicates the throughput speed of the communication link of 
the source computer on which the file is stored, on occasion, the throughput speed 

25 varies according to changing conditions of the connnunication link. The changing 
conditions include changes in the network load and the network capacity. 
Furthermore, the time it takes to obtain a file does not correlate well with how far 
the source computer is in terms of network hops or how fast it responds to a network 
Ping. Thus, these metrics are unrehable in determining the fastest source. 

30 When another user is selecting a source from which to download a particular 

file through the P2P service, often, one of the more important criteria in selecting a 

source is the throughput speed of the communication link of the computer on which 
Attorney Docket No. 884.441US1 Client Ref. No. Pll 175 

1 



the file is stored. The other user has no ability to automatically and adaptively 
confirm, verify, test or measure the indicated throughput speed. As a result, the 
other user must trust the accuracy of the indicated throughput speed. In addition, the 
other user must manually select a computer from which to download data, which 
introduces the possibility that the other user may mis-read the indicated throughput 
speed. Moreover, in conventional P2P networks, the other user often selects the 
same source to download from when a previous download has been successful, and 
in which the other user is also unknowledgeable there is another source with better 
performance. 

As a result of the many possible reasons that the indicated throughput speed 
may vary from the actual throughput speed, there is very little correlation between 
the specified throughput speed and the actual throughput speed as shown in table 1 
below: 
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Table 1 

In addition, in conventional P2P methods, after a download is started, the 
download continues until the download is completed or until the download times 
out. There is no active monitoring of performance and the end user must manually 
select a new source if the initial download does not complete, or progresses slower 
than the user desires. 

Furthermore, conventional systems do not discriminate or distinguish 
between large and small files. Also, other technologies, such as Akamai 
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Technologies Inc. that offer edge services that cache data at the network edge so 
users don't have to go all the way to the actual server for the data, do not adapt to 
changmg network conditions, they rely on a known static network topology. 

A software product named VitalAgent from Lucent Technologies measures 

5 end-to-end network performance. VitalAgent provides measurement techniques to 
test for latency and packet loss, such as estimates of round-trip time (RTT), in which 
the time between an acknowledgment (ACK) message and a synchronous idle 
(SYN) message is measured during a Transmission Control Protocol (TCP) socket 
estabUshment. An important assumption in VitalAgent is that very little time is 

10 required by a computer to respond to the SYN packet and that most of the delay is a 
result of network latency, rather than server side delays. 

A research paper titled "Dynamic Server Selection in the Internet" by Mark 
E. Crovella and Robert L. Carter describes how to find a "good" service provider for 
WWW documents based on RTT latency. Their research shows that the number of 

15 hops does not correlate with RTT latency. However, Crovella and Carter do not use 
network throughput or packet loss in their determination of a "good" service 
provider and they do not implement their findings in an algorithmic method. 
Crovella and Carter merely disclose that latency is a "good" metric to use. In 
addition, Crovella and Carter address worldwide web (WWW) documents, and do 

20 not address P2P issues. 

Furthermore, FreeNet implements a P2P technology that uses distributive 
storage or caching technology similar to Akamai. FreeNet is a large-scale peer-to- 
peer network which pools the power of member computers around the world to 
create a massive virtual information store open to anyone to freely pubHsh or view 

25 information of all kinds. Freenet is an enhanced Open Source implementation of the 
system described by Ian Clarke's 1999 paper "A distributed decentralized 
information storage and retrieval system." However, FreeNet does not assist the end 
user find a faster source. FreeNet' s storage and caching technology is adaptive after 
several attempts from different people to acqviire the data, not adaptive to changing 

30 network conditions or transaction type. 
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For the reasons stated above, and for other reasons stated below which will 
become apparent to those skilled in the art upon reading and understanding the 
present specification, there is a need in the art for the ability to automatically and 
adaptively confirm, verify, test or measure the indicated performance of the source, 
5 There is also a need for the ability to automatically select the download source 
having the fastest throughput speed. Furthermore, there is a need to adapt the 
selection process according to the size of the file to be downloaded. Moreover, 
there is a need to actively monitor performance during the download. 

10 Brief Description of the Drawings 

FIG. 1 is a block diagram of the hardware and operating environment in 
which different embodiments of the invention can be practiced. 

FIG. 2 is a diagram illustrating a system-level overview of an embodiment 
of the invention, 

15 FIG. 3 is a flowchart of a method for managing a plurality of download 

sources, performed by a cUent according to an embodiment of the invention. 

FIG. 4 is a flowchart of a method for managing a plurality of download 
sources, performed by a client according to an embodiment of the invention, 
FIG. 5 is a flowchart of a method for selecting one of a plurahty of 
20 download sources, as in FIG. 4, in which a predetermined file size is less than a 

predetermined threshold file size, performed by a cHent according to an embodiment 
of the invention. 

FIG. 6 is a flowchart of a method for selecting one of a plurality of 
download sources, as FIG. 4, in which a predetermined file size is not less than a 
25 predetermined threshold file size, performed by a cMent according to an embodiment 
of the invention. 

FIG. 7 is a flowchart of a method for measuring the latency, as FIG. 6, in 
which a predetermined file size is not less than a predetermined threshold file size, 
performed by a client according to an embodiment of the invention. 
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FIG, 8 is a flowchart of a method for managing a plurality of download 
sources, performed by a client according to an embodiment of the invention. 

FIG. 9 is a flowchart of a method for measuring the elapsed time of a 
transmission involving each of the plurality of download sources of FIG, 8, 
5 performed by a client according to an embodiment of the invention. 

FIG, 10 is a block diagram of an apparatus for managing a plurality of 
download sources, according to an embodiment of the invention. 

FIG. 1 1 is a block diagram of a source throughput empirical performance 
measurer, according to an embodiment of the invention. 
1 0 FIG. 12 is a block diagram of a data stracture implemented in the 

management of outstanding synchronous idle and acknowledgment messages in 
transmission control protocol/Internet protocol. 

FIG. 13 is a block diagram of a data structure implemented in the 
management of global DP flow statistics. 
1 5 FIG. 14 is a block diagram of a data structure implemented in the 

management of Ethemet packet data from a network interface card driver. 

FIG. 15 is a block diagram of a data structure implemented in the 
management of Transmission Control Protocol (TCP) header information. 

20 Detailed Description of the Invention 

In the following detailed description of the preferred embodiments, reference 
is made to the accompanying drawings that form a part hereof, and in which is 
shown by way of illustration specific preferred embodiments in which the 
inventions may be practiced. These embodiments are described in sufficient detail 

25 to enable those skilled in the art to practice the invention, and it is to be understood 
that other embodiments may be utilized and that logical, mechanical and electrical 
changes may be made without departing from the spirit and scope of the present 
invention. The following detailed description is, therefore, not to be taken in a 
limiting sense, and the scope of the present invention is defined only by the claims. 
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The detailed description is divided into five sections. In the first section, the 
hardware and the operating environment in conjunction v^ith which embodiments of 
the invention may be practiced are described. In the second section, a system level 
overview of the invention is presented. In the third section, methods for an 
5 embodiment of the invention are provided. In the fourth section, apparatus for an 
embodiment of the invention are provided. Finally, in the fifth section, a particular 
TCP/IP-based implementation of the invention is described. 

A system and method for automatic and adaptive use of active network 
performance measurement techniques to find the fastest source is described herein. 

10 

Hardware and Operating Environment 
FIG. 1 is a block diagram of the hardware and operating environment 100 in 
which different embodiments of the invention can be practiced. The description of 
FIG. 1 provides an overview of computer hardware and a suitable computing 

1 5 environment in conjunction with which some embodiments of the present invention 
can be implemented. Embodiments of the present invention are described in terms 
of a computer executing computer-executable instructions. However, some 
embodiments of the present invention can be implemented entirely in computer 
hardware, for instance on the network card but not limited to the network card, in 

20 which the computer-executable instructions are implemented in read-only memory. 
One embodiment of the invention can also be implemented in client/server 
computing environments where remote devices that are linked through a 
communications network perform tasks. Program modules can be located in both 
local and remote memory storage devices in a distributed computing environment. 

25 Computer 1 10 is operatively coupled to display device 1 12, pointing device 

1 15, and keyboard 116. Computer 110 includes a processor 118, commercially 
available fi*om Intel(g), random-access memory (RAM) 120, read-only memory 
(ROM) 122, and one or more mass storage devices 124, and a system bus 126, that 
operatively couples various system components including the system memory to the 

30 processing unit 118. Mass storage devices 124 are more specifically types of 
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nonvolatile storage media and can include a hard disk drive, a floppy disk drive, an 
optical disk drive, and a tape cartridge drive. The memory 120, 122, and mass 
storage devices, 124, are types of computer-readable media. A user enters 
commands and information into the computer 110 through input devices such as a 

5 pointing device 1 1 5 and a keyboard 116. Other input devices (not shown) can 
include a microphone, joystick, game pad, satellite dish, scanner, or the like. The 
processor 118 executes computer programs stored on the computer-readable media. 
Embodiments of the present invention are not limited to any type of computer 1 10. 
In varying embodiments, computer 110 comprises a PC-compatible computer, a 

10 MacOS®-compatible computer or a UNIX®-compatible computer. The 
construction and operation of such computers are well known within the art. 

Furthermore, computer 110 can be communicatively connected to the 
Internet 130 via a communication device 128. Internet 130 connectivity is well 
known within the art. In one embodiment, a communication device 128 is a modem 

1 5 that responds to communication drivers to connect to the Internet via what is known 
in the art as a "dial-up connection." In another embodiment, a communication 
device 128 is an Ethemet® or similar hardware (network) card connected to a local- 
area network (LAN) that itself is connected to the Internet via what is known in the 
art as a "direct connection" (e.g., Tl line, etc.). 

20 Computer 1 1 0 can be operated using at least one operating environment to 

provide a graphical user interface including a user-controllable pointer. Such 
operating environments include operating systems such as versions of the Microsoft 
Windows® and Apple MacOS® operating systems well known in the art. 
Embodiments of the present invention are not limited to any particxilar operating 

25 environment, however, and the construction and use of such operating environments 
are well known within the art. Computer 1 1 0 can have at least one web browser 
application program executing within at least one operating environment, to permit 
users of computer 1 10 to access intranet or Internet world-wide-web pages as 
addressed by Universal Resource Locator (URL) addresses. Such browser 
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application programs include Netscape Navigator® and Microsoft Internet 
Explorer®. 

Display device 1 12 permits the display of information, including computer, 
video and other information, for viewing by a user of the computer. Embodiments 
5 of the present invention are not limited to any particular display device 1 12. Such 
display devices include cathode ray tube (CRT) displays (monitors), as well as flat 
panel displays such as liquid crystal displays (LCD's), Display device 1 12 is 
connected to the system bus 126. In addition to a monitor, computers typically 
include other peripheral input/output devices such as printers (not shown), speakers, 

10 pointing devices and a keyboard. Speakers 113 and 114 enable the audio output of 
signals. Speakers 113 and 1 14 are also connected to the system bus 126. Pointing 
device 115 permits the control of the screen pointer provided by the graphical user 
interface (GUI) of operating systems such as versions of Microsoft Windows®. 
Embodiments of the present invention are not limited to any particular pointing 

15 device 115. Such pointing devices include mouses, touch pads, trackballs, remote 
controls and point sticks. Finally, keyboard 116 permits entry of textual information 
into computer 1 10, as known within the art, and embodiments of the present 
invention are not limited to any particular type of keyboard. 

The computer 110 can operate in a networked environment using logical 

20 connections to one or more remote computers, such as remote computer 150, These 
logical connections are achieved by a communication device coupled to, or a part 
of, the computer 110. Embodiments of the present invention are not limited to a 
particular type of communications device. The remote computer 150 can be another 
computer, a server, a router, a network PC, a chent, a peer device or other common 

25 network node. The logical coimections depicted in FIG. 1 include a local-area 
network (LAN) 151 and a wide-area network (WAN) 152. Such networking 
environments are commonplace in offices, enterprise-wide computer networks, 
intranets and the Internet. 

When used in a LAN-networking environment, the computer 110 and 

30 remote computer 1 50 are connected to the local network 151 through a network 
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interface or adapter 153, which is one type of communications device. When used 
in a conventional WAN-networking environment, the computer 110 and remote 
computer 150 communicate with a WAN 152 through modems (not shown). The 
modem, which can be internal or extemal, is connected to the system bus 126. In a 
5 networked environment, program modules depicted relative to the computer 1 10, or 
portions thereof, can be stored in the remote memory storage device. 

System Level Overview 
FIG. 2 is a block diagram that provides a system level overview of the 

1 0 operation of illustrated embodiments of the present invention. Embodiments of the 
invention are described as operating in a multi-processing, multi-threaded operating 
environment on a computer, such as computer 1 10 in FIG. 1 . 

System 200 includes a source throughput performance measurer 205. In one 
embodiment, the throughput performance measurer 205 is a source latency measurer 

15 (not shown). The source throughput performance measurer 205 receives a list of 
sources 210. The hst of sources 210 identifies sources 220 that have been identified 
as sources of a data file (not shown). The data file is identical on each of the 
sources. The source throughput performance measurer 205 transmits data to the 
sources 220. Each source responds by transmitting a response. The source 

20 throughput performance measurer 205 receives the responses, and measures the 
elapsed time between the transmittal of the data, and corresponding response. The 
list of sources 210 is modified to indicate the performance measurements, yielding a 
list of measured sources 260. The source throughput performance measurer 205 
enables the ability to confirm, verify, test or measure the associated speed of 

25 download sources. The list of measured sources 260 is then transmitted to a source 
selector 270 that selects a source 275 from the Ust of measured sources 260 in 
reference to the performance measurements. The source selector 270 selecting a 
source 275 firom the list of measured sources 260 in reference to the performance 
measurements enables system 200 to automatically select the download source 

30 having the fastest throughput speed. 
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A downloader (not shown) initiates a download of the data file from the 
source indicated by the selected source 275. 

The system level overview of the operation of an embodiment of the 
invention has been described in this section of the detailed description. The present 
invention is a system and method for automatic and adaptive use of active network 
performance measurement techniques to find the fastest source. While the invention 
is not limited to any particular downloader, for sake of clarity a simpHfied source 
throughput performance measurer 205, list of sources 210, sources 220, Hst of 
measured sources 260, source selector 270, have been described. 



Methods of an Embodiment of the Invention 
In the previous section, a system level overview of the operation of an 
embodiment of the invention was described. In this section, the particular methods 
performed by the server and the clients of such an embodiment are described by 

15 reference to a series of flowcharts. Describing the methods by reference to a 
flowchart enables one skilled in the art to develop such programs, firmware, or 
hardware, including such instructions to carry out the methods on suitable 
computerized chents (the processor of the clients executing the instructions fi:om 
computer-readable media). Similarly, the methods performed by the server 

20 computer programs, firmware, or hardware are also composed of computer- 
executable instructions. Methods 300-800 are performed by a client program 
executing on, or performed by firmware or hardware that is a part of, a computer, 
such as computer 1 1 0 in FIG. 1 . 

FIG. 3 is a flowchart of a method300 for managing a plurality of download 

25 sources, performed by a cHent according to an embodiment of the invention. 

Method 300 includes obtaining 310 a list of possible sources. In one 
embodiment, the list is obtained fi-om a peer-to-peer client program that has 
previously submitted a request for a list of sources of a particular file. Action 310 
obtains the list in response to the request. One example of a peer-to-peer client 

30 program is the Napster cUent. In the operation of the Napster cHent, the user 
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identifies the criteria for files. The criteria includes artist name and/or title of 
composition. The Napster client generates a request for files matching the criteria, 
the Napster client transmits the request to a centraUzed database server, which 
identifies the matching files, and transmits the list of possible sources for files that 
5 match the criteria. 

Subsequently, method 300 also includes establishing 320 a socket 
connection to each source in the list. One example of establishing a socket 
connection is opening a socket with the connectQ function call from the winsock32 
API. 

10 Thereafter, method 300 includes determining 330 the type of data transfer 

transaction of the file. Through the action of determining the type of data transfer 
330, method 300 enables adaptation of the selection process according to the size of 
the file to be downloaded. 

If the result of the determining 330 indicates that the type of data transfer is 

15 in reference to a small file or resource sharing, then the first source that responds to 
the estabhshment of a socket connection is identified 340, and the data transfer of 
the file from the identified source is initiated 350. Through the action of identifying 
340 the first source that responds to the establishment of a socket connection, 
method 300 enable automatic selection of the download source having the lowest 

20 latency or round-trip time (RTT), 

If the result of the determining 330 indicates that the type of data transfer is 
in reference to a large file or multimedia data, then the latency of each of the source 
in the list is measiu'ed 360, and the source associated with the least latency is 
identified 370. One example of the measure of the latency is the RTT as the 

25 difference between a synchronous idle (SYN) message and an acknowledgment 
(ACK) message. Other examples of the measuring 360 include confirming 
verifying and/or testing. Through the action of measuring 360 the latency of each 
source, method 300 enables confirming, verifying, testing or measuring the 
associated speed of download sources. In one example of identifying 370 the source 

30 associated with the least latency, the list is sorted in order of length of latency. 
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Through the action of identifying 370 the source having the least latency, method 
300 enables automatic selection of the download source having the fastest 
throughput speed* 

In one embodiment of actions 340 and 370, where a source is identified, the 
5 identification includes an Internet Protocol (EP) address. In yet another embodiment 
of actions 340 and 370, the identification is a uniform resource locator (URL), 
Thereafter, method 300 includes initiating a transfer of the file fi:om the 
identified source 380. During the transfer, performance of the transfer is measured 
390. In one embodiment, the performance of transfer of large files is measured in 

1 0 regards to throughput. In another embodiment, the performance of multimedia data 
is measured in regards to data integrity, such as packet loss. Thereafter, the 
measurement of performance is compared 395 to a predetermined threshold value. 
If the measurement of performance falls below the predetermined threshold value, 
the method continues with identifying the source associated with the least latency 

15 370. Otherwise, the transfer completes. Action 390 enables active monitor of 
performance during the download. Action 395 enables adaptive compensation for 
dynamic network performance. 

FIG. 4 is a flowchart of a method 400 for managing a plurality of download 
sources, performed by a client according to an embodiment of the invention. 

20 Method 400 includes obtaining 410 a list comprising a plurality of 

identification of sources. In one embodiment, sources include source- 1 1015, 
source-2 1020, and source-n 1025 in FIG. 10. In another embodiment, identification 
of the sources includes an Internet Protocol (IP) address. In yet another 
embodiment, identification is a uniform resource locator (URL). In still another 

25 embodiment, the sources are obtained fi-om the list of sources 210 in FIG. 2 and the 
list of sources 1010 in FIG. 10. 

Thereafter, method 400 includes initiating 420 a plurality of socket 
connections. The plurality of socket connections include one socket connection for 
each of the plurality of sources, yielding a plurality of initiated socket connections. 
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One example of an initiated socket connection is a SYN message. The destination 
address used in the socket connection is the identification obtained in action 410. 

In yet another embodiment of method 400, the throughput speed is a 
download speed. 

5 Subsequently, method 400 includes receiving 430 a response for the each of 

the plurality of initiated socket connections. The receiving 430 yields a pluraUty of 
responses. One example of a response is an ACK message. 

In varying embodiments, the fastest source of the plurahty of sources is 
selected 440 in reference to a predetermined file size and in reference to the 

10 response, as disclosed below in FIG. 5 and 6. The action of selecting 440 the first 
source that responds to the establishment of a socket connection enables method 400 
to confirm, verify, test or measure the associated speed of download sources, and to 
automatically select the download source having the fastest throughput speed. 
FIG. 5 is a flowchart of a method 500 for selecting one of a plurality of 

15 dovraload sources, as in action 440 in FIG. 4, in which a predetermined file size is 
less than a predetermined threshold file size, performed by a client according to an 
embodiment of the invention. In general, the threshold file size that distinguishes 
small files firom large files can be based on the available bandwidth of the user 
requesting the file. For example, for a client computer that has a 56Kbps modem, 

20 the threshold is approximately 560K bits or 70K bytes because transmission of that 
file would transpire in approximately ten seconds. In another embodiment, the 
threshold size is a configuration parameter set by a user of the client computer. 

Method 500 includes selecting 510 the source associated with the response 
that is received first. The action of selecting 510 the fu-st source that responds to the 

25 establishment of a socket connection enables automatic selection of the download 
source having the fastest throughput speed. 

FIG. 6 is a flowchart of a method 600 for selecting one of a pluraUty of 
download sources, as in action 440 in FIG. 4, in which a predetermined file size is 
not less than a predetermined threshold file size, performed by a client according to 

30 an embodiment of the invention. 
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Method 600 includes measuring 610 the latency of each of the plurahty of 
download sources. The action of measuring 610 the latency of each source enables 
confirming verifying, testing or measuring the associated speed of download 
sources, 

5 Subsequently, method 600 includes selecting 620 a source in reference to the 

latency of each of the plurality of download sources. Through the action of 
selecting 620 a source in reference to the latency of each of the plurahty of 
download sources, method 600 enables automatic selection of the download source 
having the fastest throughput speed. 
10 FIG, 7 is a flowchart of a method 700 for measuring the latency, as in action 

610 in FIG. 6, in which a predetermined file size is not less than a predetermined 
threshold file size, performed by a client according to an embodiment of the 
invention. 

Method 700 includes storing 710 a timestamp of each of the plurality of 

15 initiating socket connections. A timestamp includes a time, and optionally, a date. 
One example of an initiating socket call is a SYN message. Thereafter, method 700 
includes storing 720 a timestamp of each of the plurality of responses. One example 
of a response is an ACK message. Subsequently, method 700 includes determining 
730 the latency of each of the plurality of sources firom the differences between the 

20 time and date of each of the plurality of responses, that are stored in action 720, and 
the time and date of each of the plurality of initiating socket connections that are 
stored in action 710. 

Method 700 enables confirming, verifying, testing or measuring the 
associated speed of download sources. 

25 FIG. 8 is a flowchart of a method 800 for managing a plurality of download 

sources, performed by a chent according to an embodiment of the invention. 

Method 800 includes determining 810 an empirical measurement of a 
throughput speed of each of the plurality of download sources. In one example, 
action 810 is performed by the source throughput performance measurer 205 in FIG. 

30 2. In another example, action 810 is performed by the source throughput empirical 
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performance measxirer 1005 in FIG. 10. Examples of download sources include 
source-1 1015, source-2 1020, and source-n 1025 in FIG. 10. In another 
embodiment, identifications of the download sources, such as an Internet Protocol 
(IP) address, of the download sources are received through the list of sources, such 
5 as the list of sources 210 in FIG. 2 or the list of sources 1010 in FIG. 10. hi yet 
another embodiment of method 800, throughput speed is a download speed. 
Determining 810 the empirical measurement of a throughput speed of each of the 
plurality of download sources enables confirming, verifying, testing or measuring 
the associated speed of download sources. 

10 The empirical measurement can be determined from a number of sources. In 

varying embodiments, such as one or more third party sources, a local source, or by 
direct measurement. A third-party source is a source operated by a party other than 
the party is operating the apparatus, system or method of the present invention. 
Where the source is a third party source, the third-party source has previously 

1 5 determined and stored the empirical measurement of the throughput. In this 
embodiment, determining the empirical measurement includes obtaining an 
empirical measurement of a throughput speed of each of the plurality of download 
sources from one or more third-party sources. Where the source is a local source, a 
local process has previously determined and stored the empirical measurement of 

20 the throughput. In this embodiment, the determining 810 includes obtaining an 
empirical measurement of a throughput speed of each of the plurahty of download 
sources from a local source. In one example, the empirical measurement is stored 
on persistent storage 1065 in FIG. 10. Where the source of the empirical 
measurement is direct measurement, the empirical measurement is performed in a 

25 substantially real time manner and the determining includes measuring a throughput 
speed of each of the plurality of download sources. 

The empirical measurement for each source is calculated from the time 
difference between a test message that is transmitted to each source, and a response 
to the test message that is received from each source. The time difference is the 

30 elapsed time. Examples of test messages are such as test-1 1030, test-2 1035, and 
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test-n 1040 in FIG. 10, Examples of responses include responsel 1045, response2 
1050, and response-n 1055 in FIG. 10. Measuring the elapsed time difference is 
described further in FIG. 9. 

Thereafter, method 800 includes selecting 820 a source in reference to the 
5 empirical measurement of the throughput speed of each of the pluraHty of download 
sources. In one example, action 820 in performed by source selector 270 in FIG. 2. 
In another example, action 820 in performed by source selector 1070 in FIG. 10. 
Examples of a selected source include the selected source 275 in FIG. 2 and the 
selected source 1075 in FIG. 10. In varying embodiments, a source is selected in 
1 0 reference to the empirical measurements which can include round trip time, packet 
loss, past results, number of other people connected to the source, 
security/encryption of the source, and trustworthiness of the source. The action of 
selecting 820 a source in reference to the empirical measurement enable automatica 
selection of the download source having the fastest throughput speed. 
15 FIG. 9 is a flowchart of a method 900 for measuring the elapsed time of a 

transmission involving each of the plurality of download sources, as in action 810 of 
FIG. 8, performed according to an embodiment of the invention. 

Method 900 includes initiating 910 a transmission to a download source of 
the plurality of download sources. Examples of transmission include test- 1 1030, 
20 test-2 1035, and test-n 1040 in FIG. 10. Examples of download sources include 
source-1 1015, source-2 1020, and source-n 1025 in FIG. 10. 

Method 900 includes recording 920 the time of transmission. The time is the 
current time and date as indicated by the processor, such as processor 1 1 8 in FIG. 1 . 

Thereafter, method 900 also includes receiving 930 a response to the 
25 transmission from the download source. Examples of responses include a response 
responsel 1045, response2, 1050, and response-n 1055 in FIG. 10. 

Method 900 also includes recording 940 the receipt time from the current 
date and time. 

Subsequently, method 900 includes determining 950 the throughput speed of 
30 the download source from the difference between the receipt time and the 
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transmission time. The receipt time is subtracted from the transmission time, 
yielding the total throughput time. Where the size of the transmission and response 
is equal for each source tested, the throughput time can be used as a proxy for the 
throughput speed without further computation. However, in another embodiment, 
5 where the size of the transmission and response is not equal for each source tested, 
the throughput time must be divided into the transmission size for each source in 
order to determine the throughput speed for each source. 

In one embodiment, methods 300-900 are implemented as a computer data 
signal embodied in a carrier wave, that represents a sequence of instructions which, 
10 when executed by a processor, such as processor 1 18 in FIG. 1, cause the processor 
to perform the respective method. 

In another embodiment, methods 300-900 are implemented on a computer- 
accessible medium having executable instructions capable of directing a processor, 
such as processor 1 18 in FIG. 1, to perform the respective method. 

15 

Apparatus Implementation 
Referring to FIGS. 10-1 1, a particular implementation of the invention is 
described in conjunction with the system overview in FIG. 2 and the methods 
described in conjunction with FIGS. 3-9. 

20 FIG. 10 is a block diagram of an apparatus 1000 for managing a plurality of 

download sources, according to an embodiment of the invention. 

Apparatus 1000 includes a source throughput empirical performance 
measurer 1005. The source throughput empirical performance measurer 1005 
receives a list of sources 1010. The list of sources 1010 identifies sources, such as 

25 source-1 1015, source-2 1020, and source-n 1025, that have been determined as a 
source of a data file (not shown). The data file is identical on each of the sources. 
The source throughput empirical performance measurer 1005 transmits test data, 
such as test-1 1030, test-2 1035, and test-n 1040, to each of the sources. Each 
source responds by transmitting a response, such as responsel 1045, response2, 

30 1050, and response-n 1055. The sowce throughput empirical performance measurer 

Attorney Docket No. 884,441US1 Client Ref. No, P11175 

17 



1005 receives the responses, and measures the elapsed time between the transmittal 
of the test data, and corresponding response. The Ust of sources 1010 is modified to 
indicate the performance measurements, yielding a hst of measured sources 1060. 
The source throughput empirical performance measurer 1005 enables confirming, 
5 verifying, testing or measuring the associated speed of download sources. In one 
embodiment, the list of measured sources 1060 is stored in persistent storage 1065 
for future reference. 

The Hst of measured sources 1060 is then transmitted to a source selector 
1070 that selects a source 1075 from the list of measured source 1060 in reference 

10 to the performance measurements. The source selector 1070 selectiQg a source 
1075 from the Ust of measured source 1060 in reference to the performance 
measurements enables apparatus 1000 to automatically select the download source 
having the fastest throughput speed. 

A downloader (not shown) initiates a download of the data file fi^om the 

1 5 source indicated by the selected source 1 075 . 

FIG. 1 100 is a block diagram of a source throughput empuical performance 
measurer 1 100, according to an embodiment of the invention. The source 
throughput empirical performance measurer 1 100 is one embodiment of the source 
throughput empirical performance measurer 1005, shown in FIG. 10. 

20 The measurer 1 100 includes a transmitter 1 1 10 of a message to a download 

source. The transmitter 1110 receives an Intemet protocol (DP) address 1 105 of the 
download source. The download source is one of the plurality of download sources. 

The measurer 1 100 also includes a recorder 1 120 of the time 1 125 of a 
transmission of a message. The recorder 1 120 is operably coupled to the transmitter 

25 1 1 10. In one embodiment, the message further comprises a TCP/IP synchronized 
idle (SYN) message. 

The measurer 1 100 also includes a receiver 1 130 of a response to the 
transmission from the download source. The receiver 1 130 is operably coupled to 
the transmitter 1 1 10. In one embodiment, the response further comprises a TCP/IP 

30 acknowledgment (ACK) message. 
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The measurer 1 100 also includes a recorder 1 140 of the time 1 145 of receipt 
of the response. 

The measurer 1 100 also includes a determiner 1 150 of the throughput speed 
of the download source. The speed is determined from the difference between the 
5 receipt time 1 145 and the transmission time 1 125. 

The apparatus 1000 component of the source throughput empirical 
performance measurer 1005, and the apparatus 1 100 components of the source 
selector 1070, transmitter 1 1 10, recorder 1 120, receiver 1 130, recorder 1 140, and 
determiner 1 150 can be embodied as computer hardware circuitry or as a computer- 

10 readable program, or a combination of both. In another embodiment, the source 
throughput empirical performance measurer 1005, and the apparatus 1 100 
components of the source selector 1070, transmitter 1110, recorder 1120, receiver 
1130, recorder 1 140, and detemiiner 1 150 are implemented in an appUcation service 
provider (ASP) apparatus. 

1 5 More specifically, in the computer-readable program embodiment, the 

programs can be structured in an object-orientation using an object-oriented 
language such as Java, Smalltalk or C-H-, and the programs can be structured in a 
procedural-orientation using a procedural language such as COBOL or C. The 
software components communicate in any of a number of means that are well- 

20 known to those skilled in the art, such as application program interfaces (A.P.L) or 
interprocess communication techniques such as remote procedure call (R-P-C), 
common object request broker architecture (CORBA), Component Object Model 
(COM), Distributed Component Object Model (DOOM), Distributed System Object 
Model (DSOM) and Remote Method Invocation (RMI). The components execute 

25 on as few as one computer as in computer 1 10 in FIG. 1, or on at least as many 
computers as there are components. 

TCP/IP Ethemet Implementation 
In one embodiment of the present invention, communications with the 
30 sources are implemented using TCP/IP. 
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FIG. 12 is a block diagram of a data structure 1200 implemented in the 
management of outstanding synchronous idle (SYN) and acknowledgment (ACK) 
messages in transmission control protocol/Internet protocol (TCP/IP). Data 
structure 1200 is used in action 360 of method 300, action 610 of method 600, 
5 action 730 of method 700, and in all parts of apparatus 1 100 used to measure 
latency. 

Data structure 1200 includes fields storing data representing a source IP 
address (SrcEP) 1210, a destination IP (DstIP) address 1220, a source port (SrcPort) 
address 1230, a destination port (DstPort) address 1240, an expected 

10 acknowledgement (ExpectedAck) indicator 1250, a synchronrous idle time 

(SynTime) indicator 1260, a pointer to a previous outstanding synchronous idle data 
structure (PrevSyn) 1270, and a pointer to a subsequent outstanding synchronous 
idle data structure 1280 (NextSyn). In varying embodiments, the fields SrcIP, 
DstIP, SrcPort, DstPort, ExpectedAck, and SynTime are implemented as data types: 

15 unsigned long, unsigned long, imsigned short, unsigned short, unsigned long, 
respectively. 

FIG. 13 is a block diagram of a data structure 1300 implemented in the 
management of global IP flow statistics. Data structure 1300 is used in action 360 
of method 300, action 610 of method 600, action 730 of method 700, and in all parts 

20 of apparatus 1 100 used for measuring latency. 

Data structure 1300 includes fields storing data representing source IP 
address (SrcIP) 1310, a destination IP (DstIP) address 1320, a source port (SrcPort) 
address 1330, and a destination port (DstPort) address 1335. The Source port 
(SrcPort) address 1330 the destination port (DstPort) address 1335 are generally 

25 used by the higher level protocols such as TCP and UDP). Data structure 1300 
also includes a protocol 1340. Examples of protocol 1340 include TCP/IF, UDP 
and ICMP). Data structure 1300 fiirther includes a padding 1345 of unused bytes 
to create a packet of an even multiple of 32 bits or 4 bytes. Data structure 1300 
also includes least recently used (LRU) 1350, packets per second 1355 that indicates 

30 the number of packets that have accumulated since the last time it was 
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sampled, bytes 1360 that rq)resents the number of bytes that have accumulated 
since a prior sampling, loss 1365 that represents the number of packets lost, a 

sequence number (SeqNum) 1370, an acknowledgement (Ack) 1375, a 
measurement of latency in microseconds (Lat_uSec) 1380, a time when a socket 
5 connection was first made (connectTime) 1385, and a time when the socket was last 
used (LastDataTime) 1390. In varying embodiments, the SrcIP 1310, DstIP 1320, 
SrcPort 1330, DstPort 1335, protocol 1340, padding 1345, LRU 1350, packets 1355, 
bytes 1360, loss 1365, SeqNum 1370, Ack 1375, Lat_uSec 1380, connectTime 
1385, and LastDataTime 1390 are implemented as data types: unsigned long, 

10 unsigned long, unsigned short, unsigned short, unsigned short, unsigned short, 
unsigned long, unsigned long, imsigned long, unsigned long, unsigned long, 
unsigned long, unsigned long, large integer, large integer; respectively. 

FIG. 14 is a block diagram of a data structure 1400 implemented in the 
management of Ethemet packet data from a network interface card (NIC) driver. 

15 Data structure 1400 is used in action 205 of method 200, action 395 of method 300, 
action 810 of method 800, action 920, 930, and 940 of method 900, and in part 1005 
of apparatus 1000 used for measuring throughput with associated timestamps and 
data. 

Data structure 1400 includes fields storing data representing a timestamp 
20 1410, header 1420, datasize (SIZE)1430, and data 1440. In varying embodiments, 
the timestamp 1410, header 1420, datasize (SIZE)1430, and data 1440 are 
implemented as data types: large integer, imsigned char header, imsigned long, and 
unsigned char; respectively. 

FIG. 15 is a block diagram of a data structure 1500 implemented in the 
25 management of TCP header information. Data structure 1 500 is used in action 360 
of method 300, action 610 of method 600, action 730 of method 700, and in all parts 
of apparatus 1 100 used for measuring latency. 

Data structure 1 500 includes fields storing data representing a source port 
(SrcPort) 1510, a destination port (DstPort) 1520, a sequence number (SEQNUM) 
30 1530, an acknowledgment (ACK) 1540, a DataOff flags (DOF) 1550 that represents 
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the TCP header size in 4 byte quantities, an advertised window (AW) 1560 that 
represents the maximum number of bytes the sender is willing to accept, a 

checksum (C) 1 570, an urgent pointer (UrgPtr) 1580 that represents a pointer to 
urgent data, and options data (OD) 1590 that represents and end-of-option list, 
5 No-Operation, or Maximum Segment Size. In varying embodiments, the SrcPort 
1510, DstPort 1520, SEQNUM 1530, ACK 1540, DOF 1550, AW 1560, C 1570, 
UrgPtr 1580, and the OD 1590 are implemented as data types: unsigned short, 
unsigned short, unsigned long, unsigned long, unsigned short, unsigned short, 
unsigned short, unsigned short, unsigned char; respectively. Data Structure 1500 is 

10 a well known TCP data structure. 

A system and method for automatic and adaptive use of active network 
performance measurement techniques to find the fastest source has been described. 
Although specific embodiments have been illustrated and described herein, it will be 
appreciated by those of ordinary skill in the art that any arrangement which is 

1 5 calculated to achieve the same purpose may be substituted for the specific 

embodiments shown. This application is intended to cover any adaptations or 
variations of the present invention. For example, although described in procedural 
terms, one of ordinary skill in the art will appreciate that the invention can be 
implemented in an object-oriented design environment or any other design 

20 environment that provides the required relationships. 

Systems and methods are provided through which automatic and adaptive use 
of active network performance measurement techniques identifies a fastest 
download source. Before a download source is selected, empirical measurements of 
the download speed are performed. For small files, the first source to acknowledge 

25 an open socket connection is used as an indication of the fastest download source. 
For large files, a test download is performed as the empirical measurement to 
determine the fastest download source. 

In particular, one of skill in the art will readily appreciate that the names of 
the methods and apparatus are not intended to limit embodiments of the invention. 

30 Furthermore, additional methods and apparatus can be added to the components, 
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functions can be rearranged among the components, and new components to 
correspond to future enhancements and physical devices used in embodiments of the 
invention can be introduced without departing from the scope of embodiments of 
the invention. One of skill in the art will readily recognize that embodiments of the 
5 invention are appUcable to future communication devices, different file systems, and 
new data types. 

The terminology used in this application with respect to is meant to include 
all database and communication environments and altemate technologies which 
provide the same fimctionality as described herein. Therefore, it is manifestly 
1 0 intended that this invention be limited only by the following claims and equivalents 
thereof 
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